Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 10000 |
| Missing cells | 10104 |
| Missing cells (%) | 5.6% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.4 MiB |
| Average record size in memory | 144.0 B |
Variable types
| NUM | 10 |
|---|---|
| CAT | 8 |
property_type has constant value "10000" | Constant |
country has constant value "10000" | Constant |
city has constant value "10000" | Constant |
current_zones has a high cardinality: 137 distinct values | High cardinality |
zone has a high cardinality: 132 distinct values | High cardinality |
current_zones has 545 (5.4%) missing values | Missing |
zone has 545 (5.4%) missing values | Missing |
closed_price has 9014 (90.1%) missing values | Missing |
interior_area is highly skewed (γ1 = 35.28555918) | Skewed |
gros_area is highly skewed (γ1 = 36.8104142) | Skewed |
year_of_construction is highly skewed (γ1 = 80.05373257) | Skewed |
propertiesid has unique values | Unique |
interior_area has 953 (9.5%) zeros | Zeros |
gros_area has 1102 (11.0%) zeros | Zeros |
bedrooms has 424 (4.2%) zeros | Zeros |
bathrooms has 545 (5.5%) zeros | Zeros |
other_rooms has 9005 (90.0%) zeros | Zeros |
year_of_construction has 8029 (80.3%) zeros | Zeros |
year_of_renovation has 9960 (99.6%) zeros | Zeros |
Reproduction
| Analysis started | 2021-05-25 17:13:57.922380 |
|---|---|
| Analysis finished | 2021-05-25 17:14:19.609878 |
| Duration | 21.69 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 10000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 402203.9997 |
|---|---|
| Minimum | 86536 |
| Maximum | 948301 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 86536 |
|---|---|
| 5-th percentile | 88214.95 |
| Q1 | 95806.75 |
| median | 399785 |
| Q3 | 676641.75 |
| 95-th percentile | 905132.1 |
| Maximum | 948301 |
| Range | 861765 |
| Interquartile range (IQR) | 580835 |
Descriptive statistics
| Standard deviation | 303062.6713 |
|---|---|
| Coefficient of variation (CV) | 0.7535048671 |
| Kurtosis | -1.401180218 |
| Mean | 402203.9997 |
| Median Absolute Deviation (MAD) | 303153.5 |
| Skewness | 0.3223707184 |
| Sum | 4022039997 |
| Variance | 9.184698275e+10 |
| Monotocity | Strictly decreasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 102398 | 1 | < 0.1% | |
| 101691 | 1 | < 0.1% | |
| 378186 | 1 | < 0.1% | |
| 89417 | 1 | < 0.1% | |
| 91214 | 1 | < 0.1% | |
| 874821 | 1 | < 0.1% | |
| 685380 | 1 | < 0.1% | |
| 99435 | 1 | < 0.1% | |
| 435518 | 1 | < 0.1% | |
| 826681 | 1 | < 0.1% | |
| Other values (9990) | 9990 | 99.9% |
| Value | Count | Frequency (%) | |
| 86536 | 1 | < 0.1% | |
| 86543 | 1 | < 0.1% | |
| 86545 | 1 | < 0.1% | |
| 86546 | 1 | < 0.1% | |
| 86547 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 948301 | 1 | < 0.1% | |
| 948275 | 1 | < 0.1% | |
| 948219 | 1 | < 0.1% | |
| 947861 | 1 | < 0.1% | |
| 947596 | 1 | < 0.1% |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
| Apartment |
|---|
| Value | Count | Frequency (%) | |
| Apartment | 10000 | 100.0% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
property_status
Categorical
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
| Used | |
|---|---|
| New | |
| Under Construction | 460 |
| Under construction | 90 |
| In project | 10 |
| Other values (4) | 15 |
| Value | Count | Frequency (%) | |
| Used | 5964 | 59.6% | |
| New | 3461 | 34.6% | |
| Under Construction | 460 | 4.6% | |
| Under construction | 90 | 0.9% | |
| In project | 10 | 0.1% | |
| Remodelled | 6 | 0.1% | |
| Refurbished | 5 | 0.1% | |
| For refurbishment | 3 | < 0.1% | |
| To demolish or rebuild | 1 | < 0.1% |
Frequencies of value counts
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Histogram of lengths of the category
Length
| Max length | 22 |
|---|---|
| Median length | 4 |
| Mean length | 4.4427 |
| Min length | 3 |
availability
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
| Withdrawn | |
|---|---|
| Available | |
| Sold | |
| In evaluation | 266 |
| WithDrawn | 162 |
| Other values (3) | 50 |
| Value | Count | Frequency (%) | |
| Withdrawn | 6455 | 64.5% | |
| Available | 2071 | 20.7% | |
| Sold | 996 | 10.0% | |
| In evaluation | 266 | 2.7% | |
| WithDrawn | 162 | 1.6% | |
| Reserved | 34 | 0.3% | |
| In negotiation | 8 | 0.1% | |
| Rented | 8 | 0.1% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 14 |
|---|---|
| Median length | 9 |
| Mean length | 8.6066 |
| Min length | 4 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
| Albania |
|---|
| Value | Count | Frequency (%) | |
| Albania | 10000 | 100.0% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
division
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
| Tirana | |
|---|---|
| Berat | 1 |
| Budva | 1 |
| Value | Count | Frequency (%) | |
| Tirana | 9998 | > 99.9% | |
| Berat | 1 | < 0.1% | |
| Budva | 1 | < 0.1% |
Frequencies of value counts
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Histogram of lengths of the category
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.9998 |
| Min length | 5 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
| Tirana |
|---|
| Value | Count | Frequency (%) | |
| Tirana | 10000 | 100.0% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
| Distinct | 137 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 545 |
| Missing (%) | 5.4% |
| Memory size | 78.1 KiB |
| Fresku | 521 |
|---|---|
| Komuna e Parisit | 493 |
| 21 Dhjetori | 468 |
| Astiri | 467 |
| Don Bosco | 373 |
| Other values (132) |
| Value | Count | Frequency (%) | |
| Fresku | 521 | 5.2% | |
| Komuna e Parisit | 493 | 4.9% | |
| 21 Dhjetori | 468 | 4.7% | |
| Astiri | 467 | 4.7% | |
| Don Bosco | 373 | 3.7% | |
| Ali Demi | 364 | 3.6% | |
| Ish Blloku | 333 | 3.3% | |
| Liqeni i Thatë | 329 | 3.3% | |
| Rruga e Kavajës | 257 | 2.6% | |
| Yzberish | 220 | 2.2% | |
| Other values (127) | 5630 | 56.3% | |
| (Missing) | 545 | 5.5% |
Frequencies of value counts
Unique
| Unique | 21 ? |
|---|---|
| Unique (%) | 0.2% |
Histogram of lengths of the category
Length
| Max length | 37 |
|---|---|
| Median length | 11 |
| Mean length | 12.0128 |
| Min length | 3 |
| Distinct | 132 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 545 |
| Missing (%) | 5.4% |
| Memory size | 78.1 KiB |
| Fresku | 521 |
|---|---|
| Komuna e Parisit | 493 |
| 21 Dhjetori | 468 |
| Astiri | 467 |
| Don Bosco | 373 |
| Other values (127) |
| Value | Count | Frequency (%) | |
| Fresku | 521 | 5.2% | |
| Komuna e Parisit | 493 | 4.9% | |
| 21 Dhjetori | 468 | 4.7% | |
| Astiri | 467 | 4.7% | |
| Don Bosco | 373 | 3.7% | |
| Ali Demi | 364 | 3.6% | |
| Ish Blloku | 333 | 3.3% | |
| Liqeni i Thatë | 329 | 3.3% | |
| Kodra e Diellit Residence | 268 | 2.7% | |
| Rruga e Kavajës | 257 | 2.6% | |
| Other values (122) | 5582 | 55.8% | |
| (Missing) | 545 | 5.5% |
Frequencies of value counts
Unique
| Unique | 19 ? |
|---|---|
| Unique (%) | 0.2% |
Histogram of lengths of the category
Length
| Max length | 37 |
|---|---|
| Median length | 11 |
| Mean length | 12.1176 |
| Min length | 3 |
price
Real number (ℝ≥0)
| Distinct | 1258 |
|---|---|
| Distinct (%) | 12.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 105512.7118 |
|---|---|
| Minimum | 0 |
| Maximum | 3823648 |
| Zeros | 40 |
| Zeros (%) | 0.4% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 41000 |
| Q1 | 62825 |
| median | 85000 |
| Q3 | 121743.75 |
| 95-th percentile | 235000 |
| Maximum | 3823648 |
| Range | 3823648 |
| Interquartile range (IQR) | 58918.75 |
Descriptive statistics
| Standard deviation | 94015.19219 |
|---|---|
| Coefficient of variation (CV) | 0.8910319013 |
| Kurtosis | 429.4628736 |
| Mean | 105512.7118 |
| Median Absolute Deviation (MAD) | 27000 |
| Skewness | 14.09199882 |
| Sum | 1055127118 |
| Variance | 8838856363 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 75000 | 228 | 2.3% | |
| 65000 | 206 | 2.1% | |
| 85000 | 204 | 2.0% | |
| 80000 | 179 | 1.8% | |
| 55000 | 173 | 1.7% | |
| 60000 | 168 | 1.7% | |
| 90000 | 168 | 1.7% | |
| 110000 | 160 | 1.6% | |
| 70000 | 158 | 1.6% | |
| 100000 | 146 | 1.5% | |
| Other values (1248) | 8210 | 82.1% |
| Value | Count | Frequency (%) | |
| 0 | 40 | 0.4% | |
| 1 | 3 | < 0.1% | |
| 77.3 | 1 | < 0.1% | |
| 86 | 1 | < 0.1% | |
| 100 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 3823648 | 1 | < 0.1% | |
| 3200000 | 1 | < 0.1% | |
| 2500000 | 1 | < 0.1% | |
| 2000000 | 1 | < 0.1% | |
| 1050000 | 1 | < 0.1% |
| Distinct | 270 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 87.0103 |
|---|---|
| Minimum | 0 |
| Maximum | 5700 |
| Zeros | 953 |
| Zeros (%) | 9.5% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 63 |
| median | 86 |
| Q3 | 106 |
| 95-th percentile | 147 |
| Maximum | 5700 |
| Range | 5700 |
| Interquartile range (IQR) | 43 |
Descriptive statistics
| Standard deviation | 102.636134 |
|---|---|
| Coefficient of variation (CV) | 1.179586027 |
| Kurtosis | 1654.606222 |
| Mean | 87.0103 |
| Median Absolute Deviation (MAD) | 21 |
| Skewness | 35.28555918 |
| Sum | 870103 |
| Variance | 10534.17601 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 953 | 9.5% | |
| 100 | 187 | 1.9% | |
| 90 | 180 | 1.8% | |
| 95 | 168 | 1.7% | |
| 94 | 167 | 1.7% | |
| 80 | 159 | 1.6% | |
| 75 | 148 | 1.5% | |
| 60 | 148 | 1.5% | |
| 85 | 144 | 1.4% | |
| 84 | 141 | 1.4% | |
| Other values (260) | 7605 | 76.0% |
| Value | Count | Frequency (%) | |
| 0 | 953 | 9.5% | |
| 13 | 1 | < 0.1% | |
| 15 | 4 | < 0.1% | |
| 17 | 1 | < 0.1% | |
| 20 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 5700 | 1 | < 0.1% | |
| 5000 | 1 | < 0.1% | |
| 3746 | 1 | < 0.1% | |
| 2777 | 1 | < 0.1% | |
| 2200 | 1 | < 0.1% |
| Distinct | 303 |
|---|---|
| Distinct (%) | 3.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 94.9963 |
|---|---|
| Minimum | -2 |
| Maximum | 7600 |
| Zeros | 1102 |
| Zeros (%) | 11.0% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 68 |
| median | 94 |
| Q3 | 115 |
| 95-th percentile | 165 |
| Maximum | 7600 |
| Range | 7602 |
| Interquartile range (IQR) | 47 |
Descriptive statistics
| Standard deviation | 135.222345 |
|---|---|
| Coefficient of variation (CV) | 1.423448545 |
| Kurtosis | 1718.279534 |
| Mean | 94.9963 |
| Median Absolute Deviation (MAD) | 24 |
| Skewness | 36.8104142 |
| Sum | 949963 |
| Variance | 18285.08259 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 1102 | 11.0% | |
| 100 | 211 | 2.1% | |
| 110 | 203 | 2.0% | |
| 105 | 176 | 1.8% | |
| 75 | 172 | 1.7% | |
| 120 | 165 | 1.7% | |
| 90 | 149 | 1.5% | |
| 70 | 148 | 1.5% | |
| 95 | 143 | 1.4% | |
| 115 | 139 | 1.4% | |
| Other values (293) | 7392 | 73.9% |
| Value | Count | Frequency (%) | |
| -2 | 1 | < 0.1% | |
| 0 | 1102 | 11.0% | |
| 3 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% | |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 7600 | 1 | < 0.1% | |
| 6450 | 1 | < 0.1% | |
| 5000 | 1 | < 0.1% | |
| 3746 | 1 | < 0.1% | |
| 3500 | 1 | < 0.1% |
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.8157 |
|---|---|
| Minimum | 0 |
| Maximum | 21 |
| Zeros | 424 |
| Zeros (%) | 4.2% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 21 |
| Range | 21 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.8407164745 |
|---|---|
| Coefficient of variation (CV) | 0.4630260916 |
| Kurtosis | 38.02335821 |
| Mean | 1.8157 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.303387857 |
| Sum | 18157 |
| Variance | 0.7068041904 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=12)
| Value | Count | Frequency (%) | |
| 2 | 5431 | 54.3% | |
| 1 | 2687 | 26.9% | |
| 3 | 1336 | 13.4% | |
| 0 | 424 | 4.2% | |
| 4 | 93 | 0.9% | |
| 6 | 7 | 0.1% | |
| 5 | 7 | 0.1% | |
| 11 | 6 | 0.1% | |
| 8 | 5 | 0.1% | |
| 7 | 2 | < 0.1% | |
| Other values (2) | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 424 | 4.2% | |
| 1 | 2687 | 26.9% | |
| 2 | 5431 | 54.3% | |
| 3 | 1336 | 13.4% | |
| 4 | 93 | 0.9% |
| Value | Count | Frequency (%) | |
| 21 | 1 | < 0.1% | |
| 11 | 6 | 0.1% | |
| 10 | 1 | < 0.1% | |
| 8 | 5 | 0.1% | |
| 7 | 2 | < 0.1% |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.3635 |
|---|---|
| Minimum | 0 |
| Maximum | 7 |
| Zeros | 545 |
| Zeros (%) | 5.5% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 2 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.6201662766 |
|---|---|
| Coefficient of variation (CV) | 0.4548340862 |
| Kurtosis | 2.556597982 |
| Mean | 1.3635 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.3820484973 |
| Sum | 13635 |
| Variance | 0.3846062106 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=7)
| Value | Count | Frequency (%) | |
| 1 | 5430 | 54.3% | |
| 2 | 3909 | 39.1% | |
| 0 | 545 | 5.5% | |
| 3 | 94 | 0.9% | |
| 4 | 14 | 0.1% | |
| 6 | 7 | 0.1% | |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 545 | 5.5% | |
| 1 | 5430 | 54.3% | |
| 2 | 3909 | 39.1% | |
| 3 | 94 | 0.9% | |
| 4 | 14 | 0.1% |
| Value | Count | Frequency (%) | |
| 7 | 1 | < 0.1% | |
| 6 | 7 | 0.1% | |
| 4 | 14 | 0.1% | |
| 3 | 94 | 0.9% | |
| 2 | 3909 | 39.1% |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1226 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros | 9005 |
| Zeros (%) | 90.0% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.4190307849 |
|---|---|
| Coefficient of variation (CV) | 3.417869371 |
| Kurtosis | 31.61866298 |
| Mean | 0.1226 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.744567268 |
| Sum | 1226 |
| Variance | 0.1755867987 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=7)
| Value | Count | Frequency (%) | |
| 0 | 9005 | 90.0% | |
| 1 | 844 | 8.4% | |
| 2 | 92 | 0.9% | |
| 3 | 44 | 0.4% | |
| 4 | 11 | 0.1% | |
| 6 | 2 | < 0.1% | |
| 5 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 9005 | 90.0% | |
| 1 | 844 | 8.4% | |
| 2 | 92 | 0.9% | |
| 3 | 44 | 0.4% | |
| 4 | 11 | 0.1% |
| Value | Count | Frequency (%) | |
| 6 | 2 | < 0.1% | |
| 5 | 2 | < 0.1% | |
| 4 | 11 | 0.1% | |
| 3 | 44 | 0.4% | |
| 2 | 92 | 0.9% |
| Distinct | 62 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 414.2323 |
|---|---|
| Minimum | 0 |
| Maximum | 199636 |
| Zeros | 8029 |
| Zeros (%) | 80.3% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2020 |
| Maximum | 199636 |
| Range | 199636 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2146.456291 |
|---|---|
| Coefficient of variation (CV) | 5.181769483 |
| Kurtosis | 7423.197993 |
| Mean | 414.2323 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 80.05373257 |
| Sum | 4142323 |
| Variance | 4607274.61 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 8029 | 80.3% | |
| 2021 | 399 | 4.0% | |
| 2020 | 266 | 2.7% | |
| 2010 | 178 | 1.8% | |
| 2005 | 101 | 1.0% | |
| 2019 | 90 | 0.9% | |
| 2008 | 71 | 0.7% | |
| 2012 | 61 | 0.6% | |
| 2000 | 61 | 0.6% | |
| 2015 | 59 | 0.6% | |
| Other values (52) | 685 | 6.9% |
| Value | Count | Frequency (%) | |
| 0 | 8029 | 80.3% | |
| 1 | 3 | < 0.1% | |
| 2 | 3 | < 0.1% | |
| 70 | 1 | < 0.1% | |
| 85 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 199636 | 1 | < 0.1% | |
| 2024 | 1 | < 0.1% | |
| 2023 | 2 | < 0.1% | |
| 2022 | 37 | 0.4% | |
| 2021 | 399 | 4.0% |
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.0658 |
|---|---|
| Minimum | 0 |
| Maximum | 2021 |
| Zeros | 9960 |
| Zeros (%) | 99.6% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 2021 |
| Range | 2021 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 127.2828864 |
|---|---|
| Coefficient of variation (CV) | 15.78056564 |
| Kurtosis | 245.1325521 |
| Mean | 8.0658 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 15.71884779 |
| Sum | 80658 |
| Variance | 16200.93316 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=13)
| Value | Count | Frequency (%) | |
| 0 | 9960 | 99.6% | |
| 2019 | 11 | 0.1% | |
| 2018 | 6 | 0.1% | |
| 2021 | 5 | 0.1% | |
| 2015 | 4 | < 0.1% | |
| 2017 | 4 | < 0.1% | |
| 2010 | 3 | < 0.1% | |
| 2020 | 2 | < 0.1% | |
| 2005 | 1 | < 0.1% | |
| 2009 | 1 | < 0.1% | |
| Other values (3) | 3 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 9960 | 99.6% | |
| 2000 | 1 | < 0.1% | |
| 2005 | 1 | < 0.1% | |
| 2008 | 1 | < 0.1% | |
| 2009 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 2021 | 5 | 0.1% | |
| 2020 | 2 | < 0.1% | |
| 2019 | 11 | 0.1% | |
| 2018 | 6 | 0.1% | |
| 2017 | 4 | < 0.1% |
| Distinct | 246 |
|---|---|
| Distinct (%) | 24.9% |
| Missing | 9014 |
| Missing (%) | 90.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 76884.52333 |
|---|---|
| Minimum | 150 |
| Maximum | 870989 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 150 |
|---|---|
| 5-th percentile | 34000 |
| Q1 | 50625 |
| median | 65000 |
| Q3 | 87000 |
| 95-th percentile | 145000 |
| Maximum | 870989 |
| Range | 870839 |
| Interquartile range (IQR) | 36375 |
Descriptive statistics
| Standard deviation | 55302.67813 |
|---|---|
| Coefficient of variation (CV) | 0.7192953242 |
| Kurtosis | 57.2081481 |
| Mean | 76884.52333 |
| Median Absolute Deviation (MAD) | 17500 |
| Skewness | 5.704295165 |
| Sum | 75808140 |
| Variance | 3058386209 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 60000 | 28 | 0.3% | |
| 65000 | 26 | 0.3% | |
| 70000 | 25 | 0.2% | |
| 45000 | 23 | 0.2% | |
| 53000 | 19 | 0.2% | |
| 67000 | 19 | 0.2% | |
| 55000 | 17 | 0.2% | |
| 100000 | 16 | 0.2% | |
| 40000 | 16 | 0.2% | |
| 56000 | 16 | 0.2% | |
| Other values (236) | 781 | 7.8% | |
| (Missing) | 9014 | 90.1% |
| Value | Count | Frequency (%) | |
| 150 | 1 | < 0.1% | |
| 230 | 1 | < 0.1% | |
| 240 | 2 | < 0.1% | |
| 250 | 1 | < 0.1% | |
| 400 | 6 | 0.1% |
| Value | Count | Frequency (%) | |
| 870989 | 1 | < 0.1% | |
| 530000 | 1 | < 0.1% | |
| 470000 | 2 | < 0.1% | |
| 435000 | 1 | < 0.1% | |
| 400000 | 1 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| propertiesid | property_type | property_status | availability | country | division | city | current_zones | zone | price | interior_area | gros_area | bedrooms | bathrooms | other_rooms | year_of_construction | year_of_renovation | closed_price | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 948301 | Apartment | Used | Available | Albania | Tirana | Tirana | Ish Ekspozita | Ish Ekspozita | 67525.0 | 42 | 42 | 1 | 1 | 0 | 1975 | 0 | NaN |
| 1 | 948275 | Apartment | Used | Available | Albania | Tirana | Tirana | Oxhaku | Oxhaku | 59000.0 | 84 | 84 | 2 | 1 | 0 | 1985 | 0 | NaN |
| 2 | 948219 | Apartment | Used | Available | Albania | Tirana | Tirana | Zogu I Zi | Zogu I Zi | 81500.0 | 90 | 90 | 2 | 2 | 0 | 2019 | 0 | NaN |
| 3 | 947861 | Apartment | Used | Available | Albania | Tirana | Tirana | Astiri | Astiri | 60000.0 | 68 | 79 | 1 | 1 | 0 | 2015 | 0 | NaN |
| 4 | 947596 | Apartment | Used | Available | Albania | Tirana | Tirana | Institut Kamëz | Institut Kamëz | 73450.0 | 96 | 114 | 2 | 1 | 0 | 2015 | 0 | NaN |
| 5 | 947569 | Apartment | New | Available | Albania | Tirana | Tirana | Rruga e Elbasanit | Rruga e Elbasanit | 235000.0 | 204 | 215 | 3 | 2 | 0 | 2016 | 0 | NaN |
| 6 | 947536 | Apartment | Used | Available | Albania | Tirana | Tirana | Institut Kamëz | Institut Kamëz | 44200.0 | 60 | 69 | 1 | 1 | 0 | 2015 | 0 | NaN |
| 7 | 947462 | Apartment | New | Available | Albania | Tirana | Tirana | Hipoteka | Hipoteka | 179000.0 | 118 | 128 | 2 | 2 | 0 | 2018 | 0 | NaN |
| 8 | 946773 | Apartment | Used | In evaluation | Albania | Tirana | Tirana | Tregu Elektrik | Tregu Elektrik | 650000.0 | 404 | 486 | 3 | 3 | 0 | 0 | 0 | NaN |
| 9 | 946757 | Apartment | Used | Available | Albania | Tirana | Tirana | Stadiumi Dinamo | Stadiumi Dinamo | 176000.0 | 108 | 126 | 3 | 2 | 1 | 1993 | 0 | NaN |
Last rows
| propertiesid | property_type | property_status | availability | country | division | city | current_zones | zone | price | interior_area | gros_area | bedrooms | bathrooms | other_rooms | year_of_construction | year_of_renovation | closed_price | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 9990 | 86583 | Apartment | Used | Sold | Albania | Tirana | Tirana | Unaza e Re Vlorë | Unaza e Re Vlorë | 51000.0 | 82 | 93 | 2 | 0 | 0 | 0 | 0 | 51000.0 |
| 9991 | 86582 | Apartment | New | Withdrawn | Albania | Tirana | Tirana | Unaza e Re Vlorë | Unaza e Re Vlorë | 150000.0 | 120 | 270 | 2 | 2 | 0 | 0 | 0 | NaN |
| 9992 | 86581 | Apartment | New | Withdrawn | Albania | Tirana | Tirana | Unaza e Re Vlorë | Unaza e Re Vlorë | 150000.0 | 117 | 291 | 2 | 2 | 0 | 0 | 0 | NaN |
| 9993 | 86573 | Apartment | Used | Withdrawn | Albania | Tirana | Tirana | NaN | NaN | 34000.0 | 53 | 53 | 1 | 1 | 0 | 0 | 0 | NaN |
| 9994 | 86567 | Apartment | Used | Sold | Albania | Tirana | Tirana | Rruga e Kavajes | Rruga e Kavajes | 75000.0 | 100 | 0 | 2 | 1 | 1 | 0 | 0 | 72000.0 |
| 9995 | 86547 | Apartment | New | WithDrawn | Albania | Tirana | Tirana | Liqeni i Thatë | Liqeni i Thatë | 79000.0 | 110 | 118 | 2 | 2 | 0 | 0 | 0 | NaN |
| 9996 | 86546 | Apartment | Used | Sold | Albania | Tirana | Tirana | Rruga e Elbasanit | Rruga e Elbasanit | 140000.0 | 115 | 0 | 2 | 2 | 0 | 0 | 0 | 140000.0 |
| 9997 | 86545 | Apartment | New | Withdrawn | Albania | Tirana | Tirana | Kodra e Diellit | Kodra e Diellit Residence | 170000.0 | 150 | 0 | 3 | 2 | 0 | 0 | 0 | NaN |
| 9998 | 86543 | Apartment | Used | Withdrawn | Albania | Tirana | Tirana | Komuna e Parisit | Komuna e Parisit | 70000.0 | 70 | 0 | 1 | 1 | 0 | 0 | 0 | NaN |
| 9999 | 86536 | Apartment | Used | Withdrawn | Albania | Tirana | Tirana | Zogu I Zi | Zogu I Zi | 99000.0 | 101 | 109 | 2 | 2 | 0 | 0 | 0 | NaN |